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^ (57) Abstract: Synthetic DNA molecules encoding the HPV31 LI protein are provided. Specifically, the present invention provides 
O polynucleotides encoding HPV31 LI protein, wherein said polynucleotides are free from internal transcription termination signals 
O that are recognized by yeast. Also provided are synthetic polynucleotides encoding HPV31 LI wherein the polynucleotides have 
^ been codon-optimized for high level expression in a yeast cell. The synthetic molecules may be used to produce HPV31 virus- 
Q like particles (VLPs), and to produce vaccines and pharmaceutical compositions comprising the HPV31 VLPs. The vaccines of 

the present invention provide effective immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell- 

mediated immunity. 
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TITLE OF THE INVENTION 

OPTIMIZED EXPRESSION OF HPV 31 LI IN YEAST 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application claims the benefit of U.S. Provisional Application No. 60/457,172 filed 

March 24, 2003, the contents of which are incorporated herein by reference in their entirety. 

FIELD OF THE INVENTION 

The present invention relates generally to the therapy of human papillomavirus (HPV). 
10 More specifically, the present invention relates to synthetic polynucleotides encoding HPV3 1 LI protein, 
and to recombinant vectors and hosts comprising said polynucleotides. This invention also relates to 
HPV3 1 virus-like particles (VLPs) and to their use in vaccines and pharmaceutical compositions for 
preventing and treating HPV. 

1 5 BACKGROUND OF THE INVENTION 

There are more than 80 types of human papillomavirus (HPV), many of which have been 
associated with a wide variety of biological phenotypes, from benign proliferative warts to malignant 
carcinomas (for review, see McMurray et al, hit J. Exp. Pathol 82(1): 15-33 (2001)). HPV6 and 
HP VI 1 are the types most commonly associated with benign warts, nonmalignant condylomata 

20 acuminate and/or low-grade dysplasia of the genital or respiratory mucosa. HPV16 and HPV18 are the 
high-risk types most frequently associated with in situ and invasive carcinomas of the cervix, vagina, 
vulva and anal canal. More than 90% of cervical carcinomas are associated with infections of HPV16, 
HPV18 or the less prevalent oncogenic types HPV31, -33, -45, -52 and -58 (Schiffinan et al., 1 Natl 
Cancerlnst. 85(12): 958-64(1993)). The observation that HPV DNA is detected in 90-100% of cervical 

25 cancers provides strong epidemiological evidence that HPVs cause cervical carcinoma {see Bosch et al., 
X Clin. Pathol 55: 244-265 (2002)). 

Papillomaviruses are small (50-60 nm), nonenveloped, icosahedral DNA viruses that 
encode up to eight early and two late genes. The open reading frames (ORFs) of the viral genomes are 
designated El to E7, and LI and L2, where "E" denotes early and "L" denotes late. LI and L2 code for 

30 virus capsid proteins, while the E genes are associated with functions such as viral replication and cellular 
transformation. 

The LI protein is the major capsid protein and has a molecular weight of 55-60 kDa. The 
L2 protein is a minor capsid protein. Immunological data suggest that most of the L2 protein is internal 
to the LI protein. Both the LI and L2 proteins are highly conserved among different papillomaviruses. 

-1- 
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Expression of the LI protein or a combination of the LI and L2 proteins in yeast, insect 
cells, mammalian cells or bacteria leads to self-assembly of virus-like particles (VLPs) (for review, see 
Schiller and Roden, in Papillomavirus Reviews: Current Research on Papillomaviruses', Lacey, ed. 
Leeds, UK: Leeds Medical Information, pp 101-12 (1996)). VLPs are morphologically similar to 
5 authentic virions and are capable of inducing high titers of neutralizing antibodies upon administration 
into an animal or a human. Because VLPs do not contain the potentially oncogenic viral genome, they 
present a safe alternative to use of live virus in HPV vaccine development (for review, see Schiller and 
Hidesheim, J. Clin. Virol 19: 67-74 (2000)). For this reason, the LI and L2 genes have been identified as 
immunological targets for the development of prophylactic and therapeutic vaccines for HPV infection 
10 and disease. 

HPV vaccine development and commercialization have been hindered by difficulties 
associated with obtaining high expressi<?n levels of capsid proteins in successfully transformed host 
organisms, limiting the production of purified protein. Therefore, despite the identification of wild-type 
nucleotide sequences encoding HPV LI proteins such as HPV31 LI proteins (Goldsborough et al., 
15 Virology 171(1): 306-31 1 (1989), it would be highly desirable to develop a readily renewable source of 
crude HPV proteins that utilizes HPV31 LI -encoding nucleotide sequences that are optimized for 
expression in the intended host cell. Additionally, it would be useful to produce large quantities of 
HPV3 1 LI VLPs having the immunity-conferring properties of the native proteins for use in vaccine 
development. 

20 

SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods to elicit or enhance immunity 
to the protein products expressed by HPV31 LI genes, which have been associated with cervical cancer. 
Specifically, the present invention provides polynucleotides encoding HPV31 LI protein, wherein said 

25 polynucleotides are free from internal transcription termination signals that are recognized by yeast. Also 
provided are synthetic polynucleotides encoding HPV31 LI wherein the polynucleotides have been 
codon-optimized for high level expression in a yeast cell. The present invention further provides HPV3 1 
virus-like particles (VLPs) and discloses use of said VLPs in immunogenic compositions and vaccines for 
the prevention and/or treatment of HPV disease or HPV-associated cancer. 

30 The present invention relates to synthetic DNA molecules encoding the HPV31 LI 

protein. In one aspect of the invention, the nucleotide sequence of the synthetic molecule is altered to 
eliminate transcription termination signals that are recognized by yeast. In another aspect, the codons of 
the synthetic molecules are designed so as to use the codons preferred by a yeast cell. The synthetic 

-2- 
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molecules may be used as a source of HPV31 LI protein, which may self-assemble into VLPs. Said 
VLPs may be used in a VLP-based vaccine. 

A particular embodiment of the present invention comprises a synthetic nucleic acid 
molecule which encodes the HPV31 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule 
5 comprising a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 

As stated above, provided herein are synthetic polynucleotides encoding the HPV31 LI 
gene which are free from transcription termination signals that are recognized by yeast. This invention 
also provides synthetic polynucleotides encoding HPV 31 LI as described, which are further altered so as 
to contain codons that are preferred by yeast cells. 
1 0 Also provided are recombinant vectors and recombinant host cells, both prokaryotic and 

eukaryotic, which contain the nucleic acid molecules disclosed throughout this specification. 

The present invention relates to a process for expressing an HPV31 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV31 
LI protein into a yeast host cell; wherein the nucleic acid molecule is free from internal transcription 
15 termination signals that are recognized by yeast and; (b) culturing the yeast host cell under conditions 
which allow expression of said HPV31 LI protein. 

The present invention further relates to a process for expressing an HPV31 LI protein in 
a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an 
HPV31 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon-optimized for 
20 optimal expression in the yeast host cell and; (b) culturing the yeast host cell under conditions which 
allow expression of said HPV31 LI protein. 

In preferred embodiments, the nucleic acid comprises a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also relates to HPV3 1 virus-like particles (VLPs), methods of producing 
25 HPV3 1 VLPs, and methods of using HPV3 1 VLPs. 

In a preferred embodiment of the invention, the HPV3 1 VLPs are produced in yeast. In a 
further preferred embodiment, the yeast is selected from the group consisting of: Saccharomyces 
cerevisiae, Hansenala polymorphs Pichia pastoris, Kluyvermyces fragilis, Kluveromyces lactis, and 
Schizosaccharomyces pombe. 
30 Another aspect of this invention is an HPV3 1 VLP, which comprises an HPV3 1 LI 

protein produced by a HPV3 1 LI gene which is free from transcription termination signals that are 
recognized by yeast. 

Yet another aspect of this invention is an HPV3 1 VLP, which comprises an HPV3 1 LI 
protein produced by a codon-optimized HPV3 1 LI gene. In a preferred embodiment of this aspect of the 

-3- 
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invention, the codon-optimized HPV31 LI gene consists essentially of a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also provides a method for inducing an immune response in an animal 
comprising administering HPV31 virus-like particles to the animal. In a preferred embodiment, the 
5 HP V3 1 VLPs are produced by a codon-optimized gene. In a further preferred embodiment, the HPV3 1 
VLPs are produced by a gene that is free from transcription termination sequences that are recognized by 
yeast. 

Yet another aspect of this invention is a method of preventing or treating HPV-associated 
cervical cancer comprising administering to a mammal a vaccine comprising HPV3 1 VLPs. In a 
10 preferred embodiment of this aspect of the invention, the HPV3 1 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV31 virus-like particles (VLPs). 
In an alternative embodiment of this aspect of the invention, the vaccine further 
comprises VLPs of at least one additional HP V type. In a preferred embodiment, the at least one 
additional HPV type is selected from the group consisting of: HPV6, HPV11, HPV16, HPV18, HPV33, 
15 HPV35, HPV39, HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

This invention also relates to pharmaceutical compositions comprising HPV 31 virus-like 
particles. Further, this invention relates to pharmaceutical compositions comprising HPV31 VLPs and 
VLPs of at least one additional HPV type. In a preferred embodiment, the at least one additional HPV 
type is selected from the group consisting of: HPV6, HPV1 1, HP VI 6, HPV 18, HPV33, HPV35, HPV39, 
20 HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, andHPV68. 

As used throughout the specification and in the appended claims, the singular forms "a," 
"an," and "the" include the plural reference unless the context clearly dictates otherwise. 

As used throughout the specification and appended claims, the following definitions and 
25 abbreviations apply: 

The term "promoter" refers to a recognition site on a DNA strand to which the RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase to initiate and drive 
transcriptional activity. The complex can be modified by activating sequences termed "enhancers" or 
"upstream activating sequences" or inhibiting sequences termed "silencers". 
30 The term "vector" refers to some means by which DNA fragments can be introduced into 

a host organism or host tissue. There are various types of vectors including plasmid, virus (including 
adenovirus), bacteriophages and cosmids. 

The designation "31 LI wild-type sequence" refers to the HPV31 LI sequence disclosed 
herein as SEQ ID NO:l. Although the HPV 31 LI wild-type sequence has been described previously, it 

-4- 
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is not uncommon to find minor sequence variations between DNAs obtained from clinical isolates. 
Therefore, a representative HPV31 LI wild-type sequence was isolated from clinical samples previously 
shown to contain HPV 31 DNA (see EXAMPLE 1). The 31 LI wild-type sequence was used as a 
reference sequence to compare the codon-optimized HPV 31 LI sequences disclosed herein (see FIGURE 
1). 

The designation "31 LI partial rebuild" refers to a construct, disclosed herein (SEQ ID 
NO:2), in which the HPV31 LI nucleotide sequence was partially rebuilt to contain yeast-preferred 
codons for optimal expression in yeast. The 31 LI partial rebuild comprises alterations in the middle 
portion of the HPV 31 LI wild-type nucleotide sequence (nucleotides 697-1249). The complete HPV 31 
LI sequence was also rebuilt with yeast-preferred codons, which is referred to herein as the "31 LI total 
rebuild" (SEQIDNO:3). 

The term "effective amount" means sufficient vaccine composition is introduced to 
produce the adequate levels of the polypeptide, so that an immune response results. One skilled in the art 
recognizes that this level may vary. 

A "conservative amino acid substitution" refers to the replacement of one amino acid 
residue by another, chemically similar, amino acid residue. Examples of such conservative substitutions 
are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., arginine for lysine; 
glutamic acid for aspartic acid). 

The term "mammalian" refers to any mammal, including a human being. 

"VLP" or "VLPs" mean(s) virus-like particle or virus-like particles. 

"Synthetic" means that the HPV31 LI gene has been modified so that it contains a 
sequence of nucleotides that is not the same as the sequence of nucleotides present in the naturally 
occurring wild-type HPV31 LI gene. As stated above, synthetic molecules are provided herein 
comprising a sequence of nucleotides that are altered to eliminate transcription termination signals 
recognized by yeast. Also provided herein are synthetic molecules comprising codons that are preferred 
for expression by yeast cells. The synthetic molecules provided herein encode the same amino acid 
sequences as the wild-type HPV3 1 LI gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a sequence alignment showing nucleotides that were altered in the partial 
(SEQ ID NO:2) and total rebuild (SEQ ID NO:3) 31 LI genes (See EXAMPLE 2). The reference 
sequence is the 31 LI wild-type sequence (SEQ ID NO:l; see EXAMPLE 1). Nucleotides in the 31 LI 
partial and total rebuild sequences that are identical to the reference sequence are indicated with dots. 
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Altered nucleotides are indicated at their corresponding location. Nucleotide number is contained within 
the parentheses. 

FIGURE 2 shows the 31 LI total rebuild nucleotide (SEQ ID NO:3) and amino acid 
sequences (SEQ ED NO:4). The nucleotide number is indicated on the left. 
5 FIGURE 3 summarizes the changes between the three HPV 3 1 LI sequence constructs, 

which are listed on the left. The fourth column indicates the percent nucleotide identity between the 
indicated construct and the 31 LI wild-type sequence and the fifth column indicates the amino acid 
identity. The last column indicates the number of nucleotides that were altered to yeast-preferred codon 
sequences and the region where the alterations were made. 

1 0 FIGURE 4 shows a Northern blot probed specifically for HPV 3 1 LI under high 

stringency (see EXAMPLE 4). Arrows on the left indicate the position of the HPV 31 LI full length and 
truncated transcripts. Lanes labeled "3 1 wt" are from the same RNA preparation of yeast containing 3 1 
LI wild-type sequences. The lane labeled "16" contains RNA from HPV 16, which is not recognized by 
the HPV 3 1 LI probe because of the high stringency conditions. The lane labeled "Neg" is a yeast extract 

1 5 containing no LI coding sequences. Lanes labeled "3 1 R" are from RNA of two separate isolated 
colonies expressing the 31 LI partial rebuild sequence. 

FIGURE 5 shows a portion of the data from two capture radioimmunoassay (RIA) 
experiments in counts per minute (cpm)/mg total protein (see EXAMPLE 7). Cpm obtained in the RIA 
are a relative indicator of HPV 31 LI VLPs. The RIA data demonstrate increased 31 LI VLP expression 

20 in yeast protein extracts from yeast-preferred codon rebuilt gene sequences. 

FIGURE 6 shows a representative sample of the 31 LI VLPs described herein, as 
visualized by transmission electron microscopy (see EXAMPLE 8). The bar represents 100 nm. 

DETAILED DESCRIPTION OF THE INVENTION 

25 The majority of cervical carcinomas are associated with infections of specific oncogenic 

types of human papillomavirus (HPV). The present invention relates to compositions and methods to 
elicit or enhance immunity to the protein products expressed by genes of oncogenic HPV types. 
Specifically, the present invention provides polynucleotides encoding HPV31 LI and HPV31 virus-like 
particles (VLPs) and discloses use of said polynucleotides and VLPs in immunogenic compositions and 

30 vaccines for the prevention and/or treatment of HPV-associated cancer. 

The wild-type HPV31 LI nucleotide sequence has been reported (Goldsborough et al., 
Virology 171(1): 306-31 1 (1989); Genbank Accession # J04353). The present invention provides 
synthetic DNA molecules encoding the HPV31 LI protein. The synthetic molecules of the present 
invention comprise a sequence of nucleotides, wherein some of the nucleotides have been altered so as to 

-6-. 
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eliminate transcription termination signals that are recognized by yeast. In alternative embodiments, the 
codons of the synthetic molecules are designed so as to use the codons preferred by a yeast cell for high- 
level expression. The synthetic molecules may be used as a source of HPV31 LI protein, which may 
self-assemble into VLPs. Said VLPs may be used in a VLP-based vaccine to provide effective 
5 immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell-mediated 
immunity. Such VLP-based vaccines are also useful for treatment of already established HPV infections. 

Expression of HPV VLPs in yeast cells offers the advantages of being cost-effective and 
easily adapted to large-scale growth in fermenters. However, many HPV LI proteins, including HPV31 
LI (see EXAMPLE 4), are expressed at low levels in yeast cells. It has been determined in accordance 

10 with the present invention that low level expression of HPV3 1 LI is due to truncation of the mRNA 
transcript resulting from the presence of transcription termination signals that are recognized by yeast. 
By altering the HPV31 LI DNA to eliminate any potential sequences resembling yeast transcription 
termination sites, it is possible to facilitate the transcription of full-length mRNA resulting in increased 
HPV31 LI protein expression. 

15 Accordingly, in some embodiments of this invention, alterations have been made to the 

HPV31 LI DNA to eliminate any potential sequences resembling yeast transcription termination signals. 
These alterations allow expression of the full-length HPV31 transcript, as opposed to a truncated 
transcript (see EXAMPLE 4), improving expression yield. 

As noted above, synthetic DNAs of the present invention comprise alterations from the 

20 wild-type HPV3 1 LI sequence that were made to eliminate yeast-recognized transcription termination 
sites. One of skill in the art will recognize that additional DNA molecules can be constructed that encode 
the HPV31 LI protein, but do not contain yeast transcription termination sites. Techniques for finding 
yeast transcription termination sequences are well known in the art. Transcription termination and 3' end 
formation of yeast mRNAs requires the presence of three signals: (1) an efficiency element such as 

25 TATATA or related sequences, which enhances the efficiency of positioning elements located 

downstream; (2) positioning element(s), which determine the location of the poly(A) site and (3) the 
polyadenylation site (usually Py(A)n). 

The scientific literature is replete with descriptions of sequences that encode yeast 
transcription termination signals. See, for example, Guo and Sherman, Trends Biochem. Sci. 21: 477-481 

30 (1986); Guo and Sherman, Mol Cell Biol 16(6): 2772-2776 (1996); Zaret et al, Cell 28:563-573 (1982); 
Henikoff et al, Cell 33:607-614 (1983); Thalenfeld et al, J. Biol. Chem. 258(23): 14065-14068 (1983); 
Zaret etal,/. Mol Biol 176:107-135 (1984); Heidmann et al, Afo/. CellBiol 14:4633-4642 (1984); and 
Russo, Yeast 11:447-453 (1985). Therefore, one of skill in the art would have no difficulty determining 
which sequences to avoid in order to construct a synthetic HPV3 1 LI gene that produces a full-length 
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mRNA transcript in accordance with the present invention. Additionally, assays and procedures to assess 
whether a yeast transcription termination sequence is present within the synthetic sequence are well 
established in the art, so that an ordinary skilled artisan would be able to determine if a constructed 
HPV3 1 LI sequence comprises termination sequences that need to be eliminated. 
5 As described above, the present invention relates to a nucleic acid molecule encoding 

HPV type 31 LI protein, the nucleic acid molecule being free from internal transcription termination 
signals which are recognized by yeast. In exemplary embodiments of the invention, the synthetic nucleic 
acid molecules comprise a sequence of nucleotides as set forth in SEQ ED NO:2 or SEQ ID NO:3. 

In alternative embodiments of the present invention, HPV31 LI gene sequences are 

10 "optimized" for high level expression in a yeast cellular environment. Codon-optimized HPV31 LI genes 
contemplated by the present invention include synthetic molecules encoding HPV31 LI that are free from 
internal transcription termination signals which are recognized by yeast, further comprising at least one 
codon that is codon-optimized for high level expression in yeast cells. 

A "triplet" codon of four possible nucleotide bases can exist in over 60 variant forms. 

15 Because these codons provide the message for only 20 different amino acids (as well as transcription 
initiation and termination), some amino acids can be coded for by more than one codon, a phenomenon 
known as codon redundancy. For reasons not completely understood, alternative codons are not 
uniformly present in the endogenous DNA of differing types of cells. Indeed, there appears to exist a 
variable natural hierarchy or "preference" for certain codons in certain types of cells. As one example, 

20 the amino acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, TTA, 
and TTG. Exhaustive analysis of genome codon frequencies for microorganisms has revealed 
endogenous DNA of E. coli most commonly contains the CTG leucine-specifying codon, while the DNA 
of yeasts and slime molds most commonly includes a TTA leucine-specifying codon. In view of this 
hierarchy, it is generally believed that the likelihood of obtaining high levels of expression of a leucine- 

25 rich polypeptide by an E. coli host will depend to some extent on the frequency of codon.use. For 

example, it is likely that a gene rich in TTA codons will be poorly expressed in E. coli, whereas a CTG 
rich gene will probably be highly expressed in this host. Similarly, a preferred codon for expression of a 
leucine-rich polypeptide in yeast host cells would be TTA. 

The implications of codon preference phenomena on recombinant DNA techniques are 

30 manifest, and the phenomenon may serve to explain many prior failures to achieve high expression levels 
of exogenous genes in successfully transformed host organisms-a less "preferred" codon may be 
repeatedly present in the inserted gene and the host cell machinery for expression may not operate as 
efficiently. This phenomenon suggests that synthetic genes which have been designed to include a 
projected host cell's preferred codons provide an optimal form of foreign genetic material for practice of 
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recombinant DNA techniques. Thus, one aspect of this invention is an HPV3 1 LI gene that is codon- 
optimized for expression in a yeast cell. In a preferred embodiment of this invention, it has been found 
that the use of alternative codons encoding the same protein sequence may remove the constraints on 
expression of HPV3 1 LI proteins by yeast cells. 
5 In accordance with this invention, HPV31 LI gene segments were converted to 

sequences having identical translated sequences but with alternative codon usage as described by Sharp 
and Cowe (Synonymous Codon Usage in Saccharomyces cerevisiae. Yeast 7: 657-678 (1991)), which is 
hereby incorporated by reference. The methodology generally consists of identifying codons in the wild- 
type sequence that are not commonly associated with highly expressed yeast genes and replacing them 

1 0 with optimal codons for high expression in yeast cells. The new gene sequence is then inspected for 
undesired sequences generated by these codon replacements (e.g., "ATTTA" sequences, inadvertent 
creation of intron splice recognition sites, unwanted restriction enzyme sites, etc.). Undesirable 
sequences are eliminated by substitution of the existing codons with different codons coding for the same 
amino acid. The synthetic gene segments are then tested for improved expression. 

1 5 The methods described above were used to create synthetic gene segments for HPV3 1 

LI, resulting in a gene comprising codons optimized for high level expression. While the above 
procedure provides a summary of our methodology for designing codon-optimized genes for use in HPV 
vaccines, it is understood by one skilled in the ait that similar vaccine efficacy or increased expression of 
genes may be achieved by minor variations in the procedure or by minor variations in the sequence. 

20 Accordingly, the present invention relates to a synthetic polynucleotide comprising a 

sequence of nucleotides encoding an HPV31 LI protein, or a biologically active fragment or mutant form 
of an HPV3 1 LI protein, the polynucleotide sequence comprising codons optimized for expression in a 
- yeast host. Said mutant forms of the HPV3 1 LI protein include, but are not limited to: conservative 
amino acid substitutions, amino-terminal truncations, carboxy-tenninal truncations, deletions, or 

25 additions. Any such biologically active fragment and/or mutant will encode either a protein or protein 
fragment which at least substantially mimics the immunological properties of the HPV3 1 LI protein as 
set forth in SEQ ID NO:4. The synthetic polynucleotides of the present invention encode mRNA 
molecules that express a functional HPV31 LI protein so as to be useful in the development of a 
therapeutic or prophylactic HPV vaccine. 

30 One aspect of this invention is a codon-optimized nucleic acid molecule which encodes 

the HPV31 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule comprising a sequence of 
nucleotides as set forth in SEQ ID NO:2. 
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Another aspect of this invention is a codon-optimized nucleic acid molecule which 
encodes the HPV31 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule comprising a 
sequence of nucleotides as set forth in SEQ ID NO:3. 

The present invention also relates to recombinant vectors and recombinant host cells, 
5 both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 
specification. 

The synthetic HPV31 DNA or fragments thereof constructed through the methods 
described herein may be recombinantly expressed by molecular cloning into an expression vector 
containing a suitable promoter and other appropriate transcription regulatory elements, and transferred 

10 into prokaryotic or eukaryotic host cells to produce recombinant HPV3 ILL Techniques for such 

manipulations are described in the art (Sambrook et al. Molecular Cloning: A Laboratory Manual; Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York, (1989); Current Protocols in Molecular 
Biology, Ausubel et al., Green Pub. Associates and Wiley-Interscience, New York (1988); Yeast 
Genetics: A Laboratory Course Manual, Rose et al., Cold Spring Harbor Laboratory, Cold Spring Harbor, 

15 New York, (1990), which are hereby incorporated by reference in their entirety). 

Thus, the present invention further relates to a process for expressing an HPV3 1 LI 
protein in a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid 
encoding an HPV3 1 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon- 
optimized for optimal expression in the yeast host cell and; (b) culturing the yeast host cell under 

20 conditions which allow expression of said HPV3 1 LI protein. 

The present invention also relates to a process for expressing an HPV31 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV3 1 
LI protein into a yeast host cell; wherein the nucleic acid molecule is free from internal transcription 
termination signals which are recognized by yeast and; (b) culturing the yeast host cell under conditions 

25 which allow expression of said HPV3 1 LI protein. 

This invention further relates to a process for expressing an HPV31 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid as set forth in SEQ 
ID NO:2 or SEQ ID NO:3 into a yeast host cell; and, (b) culturing the host cell under conditions which 
allow expression of said HPV31 LI protein. 

30 The synthetic genes of the present invention can be assembled into an expression sette 

that comprises sequences designed to provide efficient expression of the HPV58 LI protein in the ;t 
cell. The cassette preferably contains the synthetic gene, with related transcriptional and translations 
control sequences operatively linked to it, such as a promoter, and termination sequences. In a preferred 
embodiment, the promoter is the S. cerevisiae GAL] promoter, although those skilled in the art will 
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recognize that any of a number of other known yeast promoters such as the GAL10, GAL7, ADHJ, TDH3 
or PGK promoters, or other eukaryotic gene promoters may be used, A preferred transcriptional 
terminator is the 5. cerevisiae ADH1 terminator, although other known transcriptional terminators may 
also be used. The combination of GAL1 promoter - ADH1 terminator is particularly preferred. 
5 Another aspect of this invention is an HPV3 1 virus-like particle (VLP), methods of 

producing HPV31 VLPs, and methods of using HPV31 VLPs. VLPs can self-assemble when LI, the 
major capsid protein of human and animal papillomaviruses, is expressed in yeast, insect cells, 
mammalian cells or bacteria (for review, see Schiller and Roden, in Papillomavirus Reviews: Current 
Research on Papillomaviruses', Lacey, ed, Leeds, UK: Leeds Medical Information, pp 101-12 (1996)). 

10 Morphologically indistinct HPV VLPs can also be produced by expressing a combination of the LI and 
L2 capsid proteins. VLPs are composed of 72 pentamers of LI in a T=7 icosahedral structure (Baker et 
al., Biophys. J. 60(6): 1445-56 (1991)). 

VLPs are morphologically similar to authentic virions and are capable of inducing high 
titres of neutralizing antibodies upon administration into an animal. Immunization of rabbits (Breitburd et 

15 al., J. Virol 69(6): 3959-63 (1995)) and dogs (Suzich et al., Proc. Natl. Acad. Sci. USA 92(25): 11553-57 
(1995)) with VLPs was shown to both induce neutralizing antibodies and protect against experimental 
papillomavirus infection. However, because the VLPs do not contain the potentially oncogenic viral 
genome arid can self-assemble from a single gene, they present a safe alternative to use of live virus in 
HPV vaccine development (for review, see Schiller and Hidesheim, J. Clin. Virol. 19: 67-74 (2000)). 

20 Thus, the present invention relates to virus-like particles comprised of recombinant LI 

protein or recombinant LI + L2 proteins of HPV31. 

In a preferred embodiment of the invention, the HPV3 1 VLPs are produced in yeast. In a 
further preferred embodiment, the yeast is selected from the group consisting of: Saccharomyces 
cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, Kluveromyces lactis, and 

25 Schizosaccharomyces pombe. 

Another aspect of this invention is an HPV3 1 VLP, which comprises an HPV3 1 LI 
protein produced by a HPV3 1 LI gene that is free from internal transcription termination signals that are 
recognized by yeast. 

Yet another aspect of this invention is an HPV31 VLP which comprises an HPV31 LI 
30 protein produced by a codon-optimized HPV31 LI gene. In a preferred embodiment of this aspect of the 
invention, the codon-optimized HPV31 LI gene consists essentially of a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

Yet another aspect of this invention is a method of producing HPV3 1 VLPs, comprising: 
(a) transforming yeast with a recombinant DNA molecule encoding HPV31 LI protein or HPV31 LI + 
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L2 proteins; (b) cultivating the transformed yeast under conditions that permit expression of the 
recombinant DNA molecule to produce the recombinant HPV31 protein; and (c) isolating the 
recombinant HPV31 protein to produce HPV31 VLPs. 

In a preferred embodiment of this aspect of the invention, the yeast is transformed with a 
5 HPV3 1 LI gene that is free from transcription termination signals that are recognized by yeast. In 
another preferred embodiment, the yeast is transformed with a codon-optimized HPV3 1 LI gene to 
produce HPV3 1 VLPs. In a particularly preferred embodiment, the codon-optimized HPV3 1 LI gene 
consists essentially of a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also provides a method for inducing an immune response in an animal 
10 comprising administering HPV31 virus-like particles to the animal. In a preferred embodiment, the 

HP V3 1 VLPs are produced by a gene that is free from internal transcription termination sequences that 
are recognized by yeast. In a further preferred embodiment, the HPV3 1 VLPs are produced by a codon- 
optimized gene. 

Yet another aspect of this invention is a method of preventing or treating HPV-associated 
1 5 cervical cancer comprising administering to a mammal a vaccine comprising HPV3 1 VLPs. In a 
preferred embodiment of this aspect of the invention, the HPV3 1 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV31 virus-like particles (VLPs). 
In an alternative embodiment of this aspect of the invention, the vaccine further 
comprises VLPs of at least one additional HPV type. In a preferred embodiment, the at least one 
20 additional HPV type is selected from the group consisting of: HPV6, HPV1 1, HPV16, HPV18, HPV33, 
HPV35, HPV39, HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

In a preferred embodiment of this aspect of the invention, the vaccine further comprises 

HPV 16 VLPs. 

In another preferred embodiment of the invention, the vaccine further comprises HPV 16 
25 VLPs and HPV1 8 VLPs. 

In yet another preferred embodiment of the invention, the vaccine further comprises 
HPV6 VLPs, HPV1 1 VLPs, HPV 16 VLPs and HPV18 VLPs. 

This invention also relates to pharmaceutical compositions comprising HPV 31 virus-like 
particles. Further, this invention relates to pharmaceutical compositions comprising HPV3 1 VLPs and 
30 VLPs of at least one additional HPV type, hi a preferred embodiment, the at least one additional HPV 
type is selected from the group consisting of: HPV6, HPV11, HPV 16, HPV18, HPV33, HPV35, HPV39, 
HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

Vaccine compositions of the present invention may be used alone at appropriate dosages 
defined by routine testing in order to obtain optimal inhibition of HPV31 infection while minimizing any 
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potential toxicity. In addition, co-administration or sequential administration of other agents may be 
desirable. 

The amount of virus-like particles to be introduced into a vaccine recipient will depend 
on the immunogenicity of the expressed gene product In general, an immunologically or 
5 prophylactically effective dose of about 10 jig to 100 |ig, and preferably about 20 fig to 60 jig of VLPs is 
administered directly into muscle tissue. Subcutaneous injection, intradermal introduction, impression 
though the skin, and other modes of administration such as intraperitoneal, intravenous, or inhalation 
delivery are also contemplated. It is also contemplated that booster vaccinations may be provided. 
Parenteral administration, such as intravenous, intramuscular, subcutaneous or other means of 
10 administration with adjuvants such as alum or Merck alum adjuvant, concurrently with or subsequent to 
parenteral introduction of the vaccine of this invention is also advantageous. 

All publications mentioned herein are incorporated by reference for the purpose of 
describing and disclosing methodologies and materials that might be used in connection with the present 
15 invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate 
such disclosure by virtue of prior invention. 

Having described preferred embodiments of the invention with reference to the 
accompanying drawings, it is to be understood that the invention is not limited to those precise 
embodiments, and that various changes and modifications may be effected therein by one skilled in the art 
20 without departing from the scope or spirit of the invention as defined in the appended claims. 

The following examples illustrate, but do not limit the invention. 

EXAMPLE 1 

Determination of a representative HPV 31 LI sequence 

25 The HPV 31 LI wild-type sequence has been described previously (Goldsborough et al., 

Virology 171(1): 306-31 1 (1989); Genbank Accession # J04353). It is not uncommon, however, to find 
minor sequence variations between DNAs obtained from clinical isolates. To isolate a representative 
HPV31 LI wild-type sequence, DNA was isolated from three clinical samples previously shown to 
contain HPV 31 DNA. HPV 31 LI sequences were amplified in a polymerase chain reaction (PCR) using 

30 Taq DNA polymerase and the following primers: HPV 31 LI F 5 * - CGT CGA CGT AAA CGT GTA 
TCA TAT TTT TTT ACA G - 3' (SEQ ID NO:5) and HPV 31 LIB 5' - CAG ACA CAT GTA TTA 
CAT ACA CAA C - 3' (SEQ ID NO:6). The amplified products were electrophoresed on agarose gels 

- 13- 



BNSDOCID: <WO 20O4O84831A2_l_> 



WO 2004/084831 



PCT/US2004/008677 



and visualized by ethidium bromide staining. The ~ 1500 bp LI bands were excised and DNA purified 
using the QIA quick PCR purification kit (Qiagen, Hilden, Germany). The DNA was then ligated to the 
TA cloning vector, pCR-II (Invitrogen Corp., Carlsbad, CA), E. coli transformed, and plated on LB agar 
with ampicillin plus IPTG and X-gal for blue/white colony selection. The plates were inverted and 
incubated for 16 hours at 37°C. White colonies were cultured in LB medium with ampicillin, shaking at 
37°C for 16 hours, and minipreps were performed to extract the plasmid DNA. 

To demonstrate the presence of the LI gene in the plasmid, restriction endonuclease 
digestions were conducted and viewed by agarose gel electrophoresis and ethidium bromide staining. 
DNA sequencing was performed on plasmids containing cloned LI from each of the tfiree clinical 
isolates. DNA and translated amino acid sequences were compared with one another and the Genbank 
HPV 31 LI sequences. Sequence analysis of the three clinical isolates revealed that no sequence was 
identical to the Genbank sequence. The pCR-II-HPV 31L1/81 clone was chosen to be the representative 
31L1 sequence and is referred to herein as the "31 LI wild-type sequence" (SEQ ID NO:l, see FIGURE 
1). The sequence chosen as 31 LI wild-type contained one silent substitution at nucleotide 1266 and a 
change from a C to a G at nucleotide 1295, altering the encoded amino acid from threonine to serine. The 
31 LI partial and total rebuilt genes (SEQ ID NOs: 2 and 3, respectively) also encode a serine at this 
location (see FIGURE 1). In all cases, the amino acid sequences are identical. Nucleotides were changed 
in the rebuilt constructs to encode amino acids using yeast-preferred codon sequences and to eliminate 
potential transcription termination signals (see EXAMPLE 2). 

The 31 LI wild-type sequence was amplified using the LS-101 5' - CTC AGA TCT CAC 
AAA ACA AAA TGT CTC TGT GGC GGC CTA GC - 3' (SEQ ID NO:7) and LS-102 5' - GAC AGA 
TCT TAC TTT TTA GTT TTT TTA CGT TTT GCT GG - 3' (SEQ ID NO:8) primers to add Bglll 
extensions. PCR was performed using Vent™ DNA polymerase. The PCR product was visualized by 
ethidium bromide staining of an agarose gel. The ~ 1500 bp band was excised and DNA purified using 
the QIAEX II gel extraction kit (Qiagen). The PCR product was then digested with Bglll at 37 °C for 2 
hours and purified using the QIA quick PCR purification kit. The BglQ digested 3 1 LI PCR product was 
ligated to BamHl digested pGALl 10 and DH5 E. coli were transformed. Colonies were screened by PCR 
for the HPV 31 LI insert in the correct orientation. Sequence and orientation were confirmed by DNA 
sequencing. The selected clone was named pGALl 10-HPV 31L1 #2. 

Maxiprep DNA was then prepared and Saccharomyces cerevisiae were made competent 
and transformed. The yeast transformation was plated in Leu' sorbitol top-agar on Leu sorbitol plates 
and incubated inverted for 3-5 days at 30°C, Colonies were picked and streaked for isolation on Leu 
sorbitol plates. To induce LI transcription and protein expression, isolated colonies were subsequently 
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grown in 5 ml of 5 X Leu Ade sorbitol with 1.6% glucose and 4% galactose in rotating tube cultures at 
30°C. 

EXAMPLE 2 

5 Yeast codon optimization 

Yeast-preferred codons have been described (Sharp and Cowe, Yeast 7: 657-678 (1991)). 
Initially, the middle portion of HPV 31 LI, representing nucleotides 697-1249, was rebuilt utilizing yeast- 
preferred codons. The strategy employed to rebuild was to design long overlapping sense and antisense 
oligomers that span the region to be rebuilt, substituting nucleotides with yeast-preferred codon sequences 

1 0 while maintaining the same amino acid sequence. These oligomers were used in place of template DNA 
in the PCR reaction. Additional amplification primers were designed and used to amplify the rebuilt 
sequences from template oligomers with Pfu DNA polymerase (Stratagene, La Jolla, CA). The optimal 
conditions for amplification were section-specific; however, most employed a program resembling the 
following: an initial denaturation step of 94°C for 1 minute, followed by 15-25 cycles of 95°C for 30 sec 

15 denature, 55°C for 30 sec anneal, 72°C for 3.5 minutes extension, followed by a 72°C for 10 minute final 
extension and 4°C hold. 

PCR products were examined by agarose gel electrophoresis. Bands of the appropriate 
size were excised and the DNA was gel purified. The amplified fragments were then used as template to 
assemble the 552 nucleotide rebuilt HPV 31 middle LI fragment. PCR was then used to amplify the 

20 wild-type nucleotides 1-725 (5'end) and 1221-1515 (3'end). A final PCR using the 5'end, the 3'end, and 
the rebuilt middle was performed to generate full-length 3 1 LI partial rebuild, referred to herein as the 
"31 LI partial rebuild". 

The complete 3 1 LI sequence was also rebuilt with yeast-preferred codons. This 
construct is referred to herein as the "31 LI total rebuild". Nine long overlapping oligomers were used to 

25 generate yeast-preferred codon nucleotide sequences from 1-753 and four long overlapping oligomers 
were used to generate yeast-preferred codon nucleotide sequences from 1207-1515. After amplification 
and gel purification, these fragments, along with the middle rebuilt section described above (nucleotides 
697-1249), were used together in a PCR reaction to generate the full length 31 LI total rebuild sequence. 
This piece was generated with BamRI extensions. The gel purified rebuilt 3 1L1 DNA was digested with 

30 BamHLi ligated to BamYR digested pGALl 10 expression vector and transformed into E. coli DH5 cells. 
Colonies were screened by PCR for the HPV 3 1 LI insert in the correct orientation. Sequence and 
orientation were confirmed by DNA sequencing. 
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Plasmid DNA was prepared. S. cerevisiae cells were made competent and transformed. 
The yeast were plated in Leu" sorbitol top-agar on Leu sorbitol plates and incubated inverted for 3-5 
days. Colonies were streaked for isolation on Leu- sorbitol plates. Isolated colonies were subsequently 
5 grown in 5 ml of 5 X Leu- Ade- sorbitol with 1.6% glucose and 4% galactose in rotating tube cultures at 
30°C to induce LI transcription and protein expression. After 48-72 hours, culture volume equivalent to 
an OD600 = 10 was pelleted, supernate removed and the pellets frozen and stored -70°C. 

EXAMPLE 3 

10 RNA preparation 

Cell pellets of transformed yeast, which were induced to express HPV 31 LI by galactose 
induction, were thawed on ice and suspended in 1 ml of cold DEPC-treated water. Cells were pelleted by 
centrifugation and the resulting supernatant was removed. The cell pellet was then resuspended in 400 \i\ 
TES (10 mM Tris pH7.0, 10 mM EDTA and 0.5% SDS). An equal volume of AE buffer-saturated 

15 phenol (50 mM NaOAc and 10 mM EDTA) was added. The tube was vortexed for 10 seconds and 
heated to 65°C for 50 minutes with mixing every 10 minutes. The tube was then placed on ice for 5 
minutes, followed by centrifugation at 4°C for 5 minutes. The supernatant was collected and transferred 
to a sterile tube. An additional 400 jal of phenol was added, the tube vortexed, placed on ice for 5 
minutes and centrifuged. The supernatant was transferred to a sterile tube and 400 \x\ of chloroform 

20 added, mixed and centrifuged. The supernatant was again collected and transferred to a sterile tube and 
40 jil 3 M Na Acetate pH 5.2 added in addition to 1 ml 100% EtOH. The tube was placed on dry ice for 
one hour, after which it was centrifuged at high speed to pellet the RNA. The RNA was washed one time 
with 70% EtOH and air-dried. The RNA was then suspended in 100 jal DEPC-treated water and heated to 
65°C for 5 minutes to dissolve. Spectrophotometry was performed to determine the concentration of 

25 RNA in the sample using the assumption that an A260 reading of 1 = 40 [ig/ml RNA when the A260/280 
is 1.7-2.0. 



-16- 



BNSDOCID: <WO 200408483 1A2_I_> 



WO 2004/084831 PCTAJS2004/008677 



EXAMPLE 4 

Northern blot analysis 

Initial analysis of yeast expressing 31 LI wild-type suggested that the expression yield of 
HPV 31 LI protein was considerably less than was expected. To determine if the low expression was 
5 occurring due to a problem at the transcription level versus the translation level, Northern blot analysis of 
the HPV 31 LI transcript was performed. Northern blots were made from gels in which RNA from yeast 
expressing HPV 16 LI was run with RNA from yeast expressing HPV3 1 LI on the same gel to compare 
transcript sizes. 

A 1.2% agarose formaldehyde gel was cast. Ten micrograms of RNA was combined 

10 with denaturing buffer (final concentrations: 6% formaldehyde, 45% formamide and 0.9 x MOPS) and 
heated to 55°C for 15 minutes. A one-tenth volume of gel loading buffer was added and the sample 
loaded onto the gel. Electrophoresis was performed at 65 volts in 1 x MOPS buffer for ~ 5 hours. The 
gel was washed for 15 minutes in sterile water followed by two five minute washes in 10 x SSC. The 
RNA was transferred to a Hybond-N+ nylon membrane (Amersham Biosciences, Piscataway, NJ) by 

15 capillary action over 16 hours in 10 x SSC. The RNA was then fixed to the nylon membrane by cross- 
linking using the Amersham cross-linker set for 700 units of energy. After fixing, the nylon membrane 
was allowed to air dry. The membrane was placed in 30 ml Zetaprobe buffer at 55°C for 2 hours after 
which 32P-labeled probes were added and incubated for 16 hours at 53-65°C. The membrane was then 
washed 3 times in 5 X SSC at room temperature for 20 minutes, followed by 2 times in 0.4 x SSC for 20 

20 minutes at room temperature and once at 60°C for 10 minutes. Probe DNA was generated by PCR using 
HPV 31 LI sequence specific sense and antisense primers. The amplified DNA was labeled by treatment 
with polynucleotide kinase (PNK) and y- 32P ATP at 37°C for 1 hour. The blot was wrapped in saran 
wrap and exposed to x-ray film for 16 hours. Upon film development, probe-hybridized RNA was 
detected as a black band on the autoradiograph. 

25 Analysis of the Northern blot described above revealed that the majority of the full- 

length HPV 31 LI wild-type transcripts were considerably smaller than full length (see FIGURE 4). 
However, the 31 LI partial rebuild was designed not only to insert yeast-preferred codons in the middle 
of the gene, but also to eliminate any potential sequences resembling yeast transcription termination sites. 
Northern blot analysis clearly showed that upon rebuilding, the length of the 31 LI gene transcript had 

30 significantly increased to a size corresponding with that of the full-length HPV 16 LI transcript (not 

shown). Thus, premature transcription termination is likely to have accounted for a significant portion of 
the low expression yield from the 31 LI wild-type construct. 
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EXAMPLE 5 

HPV 31 LI protein expression 

Frozen yeast cell pellets of galactose induced cultures equivalent to OD600= 10 were 
5 thawed on ice and suspended in 300 jil of PC buffer (100 mM Na2HP04 and 0.5 M NaCl, pH 7.0) with 
2mM PMSF. Acid- washed 0.5mm glass beads were added, ~ 0.5g/tube. The tubes were vortexed for 15 
minutes at 4°C. 7.5 \xl of 20% TritonXlOO was added and vortex repeated for 5 minutes at 4°C. The 
tubes were placed on ice for 15 minutes, then centrifoged for 15 minutes at 4°C. The supernate was 
transferred to a sterile microcentrifuge tube and stored at -70°C. 

10 

EXAMPLE 6 

Western blot analysis 

Total yeast protein extract from twenty to forty isolated yeast colonies for each HPV 31 
LI construct were analyzed by Western blot to confirm expression of HPV 3 1 LI protein after galactose 
15 induction. 

Ten micrograms of total yeast protein extract was combined with SDS-PAGE loading 
buffer and heated to 95°C for 10 minutes. The proteins were loaded onto an 8% SDS-PAGE gel and 
electrophoresed in Tris-Glycine buffer. After protein separation, the proteins were Western transferred 
from the gel to nitrocellulose and the blot was blocked in 10% non-fat dry milk in TTBS (Tris buffered 

20 saline with Tween -20) for 16 hours. The blot was washed three times in TTBS. Goat anti-trpE-HPV 16 
LI serum, a polyclonal serum that cross-reacts with HPV 31 LI, was applied at a 1 : 1000 dilution in 
TTBS for 1 lir at room temperature. The blot was washed three times in TTBS and anti-goat-HRP 
conjugated antibody .was applied at a 1 :2500 dilution in TTBS for 1 hr. The blot was again washed three 
times and ECL™ detection reagent was applied (Amersham Biosciences, Piscataway, NJ). 

25 Autoradiography was then performed. Proteins recognized by the antiserum were visualized by the 
detection reagent as dark bands on the autoradiograph. 

In all cases, the HPV 3 1 LI protein was detected as a distinct band on the autoradiograph 
corresponding to approximately 55 kD (data not shown). The HPV 16 LI protein was included as a 
positive control on the gels. 

30 

-18- 

BNSOOCID: <WO 2004084831 A2J_> 



WO 2004/084831 



( 

PCT/US2004/008677 



EXAMPLE 7 

Radioimmunoassay fRIA) 

The yeast cells expressing HPV 3 1 LI were grown by a variety of methods, including 
rotating tube cultures, shake flasks and fermenters. The yeast were lysed and protein extracts made to 
5 determine the amount of HPV 31 LI virus-like particles (VLPs) produced per milligram of total protein. 
To demonstrate HPV 3 1 LI VLP expression, a portion of each total yeast protein extract was analyzed by 
capture radioimmunoassay (RIA). 

The RIA was performed using a detection monoclonal antibody, H31.A6, that is HPV 
type 3 1 -specific and VLP conformational-specific. H3 1 .A6 is specific for HPV type 3 1 LI as it is found 

1 0 to bind intact HPV 3 1 LI VLPs and does not recognize denatured HPV 3 1 VLPs. This mAb can be 

subsequently detected by a goat anti-mouse antibody radiolabeled with 1125. Therefore, the counts per 
minute (cpm) values correspond to relative levels of HPV31 LI VLP expression. 

Polystyrene beads were coated with a goat anti-trpE-HP V3 1 LI polyclonal serum diluted 
1:1000 in PBS overnight. The beads were then washed with 5 volumes of sterile distilled water and air- 

15 dried. The antigen, total yeast protein extract from isolated yeast colonies, was then loaded onto the 

beads by dilution in PBS with 1% BSA, 0.1% Tween-20 and 0.1% Na Azide and incubated with rotation 
for one hour. After washing, the beads were distributed one per well in a 20-well polystyrene plate and 
incubated with H3 1 .A6 mAb diluted 1 :50,000 for 17-24 hours at room temperature. The beads were 
washed and.I125 labeled goat anti-mouse IgG was added at an activity range of 23000-27000 cpm per 10 

20 [xl. After 2 hours, the beads were washed and radioactive counts were recorded in cpm/ml. Background 
counts from blank wells were subtracted from the total cpm/ml, giving the RIA minus background value. 

Two experiments were performed: in experiment 1, protein extracts from 31 LI wild-type 
and 3 1 LI partial rebuild were compared and in experiment 2, protein extracts from 31 LI partial rebuild 
and 3 1 LI total rebuild were compared (see FIGURE 5). Results indicate that 3 1 LI partial rebuild VLP 

25 expression is 6.9 fold greater than 31 LI wild-type. The 31 LI total rebuild has a 1.7 fold increased 

expression over the 31 LI partial rebuild. Therefore, the 31 LI expression levels were increased > 7 fold 
by introducing yeast-preferred codon sequences and eliminating potential transcription termination 
signals. 
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EXAMPLES 

Transmission electron microscopy 

To demonstrate that the HPV 3 1 LI protein was in fact self-assembling to form 
pentarneric-Ll capsomers, which in turn self-assemble into virus-like particles, a partially purified 31 LI 
5 total rebuild protein extract was subject to transmission electron microscopy (TEM). Yeast were grown 
under small scale fermentation and pelleted. The pellets were subjected to purification treatments. Pellet 
and clarified yeast extracts were analyzed by immunoblot to demonstrate LI protein expression and 
retention through the purification procedure. Clarified yeast extracts were then subjected to 
centrifiigation over a 45%-sucrose cushion and the resulting pellet suspended in buffer for TEM analysis 
1 0 (see FIGURE 6). Results indicated that the diameter of the spherical particles in this crude sample ranged 
from between 30 and 60 nrn with some particles displaying a regular array of capsomers. 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid molecule comprising a sequence of nucleotides that encodes an 
HPV3 1 LI protein as set forth in SEQ ID NO:4 3 the nucleic acid sequence being codon-optimized for 

5 high level expression in a yeast cell. 

2. A vector comprising the nucleic acid molecule of claim 1 . 

3. A host cell comprising the vector of claim 3 . 

10 

4. The host cell of claim 3, wherein the host cell is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula polymorphs, Pichia pastoris, Kluyvermyces fragilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 

15 5. The host cell of claim 4, wherein the host cell is Saccharomyces cerevisiae, 

6. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides 
comprises a sequence of nucleotides as set forth in SEQ ID NO:2. 

20 7. A vector comprising the nucleic acid molecule of claim 6. 

8. A host cell comprising the vector of claim 7. 

9. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides 
25 comprises a sequence of nucleotides as set forth in SEQ ID NO:3. 

10. A vector comprising the nucleic acid molecule of claim 9. 

11. A host cell comprising the vector of claim 1 0. 

JO 

12. Virus-like particles (VLPs) comprised of recombinant LI protein or recombinant 
LI + L2 proteins of HPV31. 
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13. The VLPs of Claim 12 wherein the recombinant LI protein or the recombinant 
LI + L2 proteins are produced in yeast. 

14. The VLPs of claim 13, wherein the recombinant LI protein or recombinant LI + 
5 L2 proteins are encoded by a codon-optimized HPV3 1 LI nucleic acid molecule. 

15. The VLPs of claim 14, wherein the codon-optimized nucleic acid molecule 
consists essentially of a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 



10 1 6. A method of producing the VLPs of Claim 1 4, comprising: 

(a) transforming yeast with a codon-optimized DNA molecule 
encoding HPV3 1 LI protein or HPV3 1 LI + L2 proteins; 

(b) cultivating the transformed yeast under conditions that permit 
expression of the codon-optimized DNA molecule to produce a 

1 5 recombinant papillomavirus protein; and 

(c) isolating the recombinant papillomavirus protein to produce the 
VLPs of Claim 14. 



20 



17. A vaccine comprising the VLPs of Claim 14. 

18. Pharmaceutical compositions comprising the VLPs of claim 14. 



19. A method of preventing HPV infection comprising administering the vaccine of 
Claim 1 7 to a mammal. 

25 

20. A method for inducing an immune response in an animal comprising 
administering the VLPs of Claim 14 to an animal. 



21 . The virus-like particles of Claim 14 wherein the yeast is selected from the group 
30 consisting of Saccharomyces cerevisiae, Hansemda polymorpha, Pichia pastoris, Kluyvermyces fragilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 



22. The virus-like particles of claim 21, wherein the yeast is Saccharomyces cerevisiae. 
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23. The vaccine of claim 17, further comprising VLPs of at least one additional HPV 

type. 

24. The vaccine of claim 23, wherein the at least one additional HPV type is selected 
5 from the group consisting of: HPV6, HPV11, HPV 16, HPV 18, HPV33, HPV35, HPV39, HPV45, 

HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

25. The vaccine of claim 24, wherein the at least one HPV type comprises HPV16. 
10 26. The vaccine of claim 25, further comprising HPV18 VLPs. 

27. The vaccine of claim 26, further comprising HPV6 VLPs and HPV1 1 VLPs. 

28. A nucleic acid molecule comprising a sequence of nucleotides that encodes an 
15 HPV31 LI protein, the nucleic acid molecule free from transcription termination signals that are 

recognized by yeast. 

29. A vector comprising the nucleic acid molecule of claim 28. 

20 30. A host cell comprising the vector of claim 29. 

3 1 . The host cell of claim 30, wherein the host cell is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula polymorphs Pichia pastoris, Kluyvermyces fi-agilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 



25 



32. The host cell of claim 3 1 , wherein the host cell is Saccharomyces cerevisiae. 



33. The VLPs of claim 13, wherein the recombinant LI protein or recombinant LI + 
L2 proteins are encoded by a HPV31 LI nucleic acid molecule that is free from transcription termination 

30 signals that are recognized by yeast. 

34. A method of producing the VLPs of Claim 33, comprising: 
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(a) transforming yeast with a DNA molecule encoding HPV3 1 LI 
protein or HPV3 1 LI + L2 proteins, the DNA molecule free from 
transcription termination sequences that are recognized by yeast; 

(b) cultivating the transformed yeast under conditions that permit 
5 expression of the DNA molecule to produce a recombinant 

papillomavirus protein; and 

(c) isolating the recombinant papillomavirus protein to produce the 
VLPs of Claim 33. 

10 35. A vaccine comprising the VLPs of Claim 33. 

36. Pharmaceutical compositions comprising the VLPs of claim 33. 

37. A method of preventing HPV infection comprising administering the vaccine of 
1 5 Claim 35 to a mammal. 

38. A method for inducing an immune response in an animal comprising 
administering the VLPs of Claim 33 to the animal. 

20 39. The vaccine of claim 35, further comprising VLPs of at least one additional HPV 

type. 

40. The vaccine of claim 39, wherein the at least one additional HPV type is selected 
from the group consisting of: HPV6, HPVi 1, HPV 16, HPV18, HPV33, HPV35, HPV39, HPV45, 

25 HPV51,HPV52,HPV55,HPV56,HPV58,HPV59,andHPV68. . 

41 . The vaccine of claim 40, wherein the at least one HPV type comprises HPV 16. 

42 . The vaccine of claim 4 1 , further comprising HPV 1 8 VLPs . 

30 

43. The vaccine of claim 42, further comprising HPV6 VLPs and HPVI 1 VLPs. 
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HPV 31 LI nucleotide sequence alignment. 

31 LI wt ( 1) ATGTCTCTGTGGC66CCTA6CGAGGCTACTGTCTACTTACCACCTGTCCC 

31 LI partial ( 1) 

31 LI total ( 1) T A.A..ATCT..A C G A 

31 LI wt ( 51) AGTGTCTAAAGTTGTAAGCACGGATGAATATGTAACACGAACCAACATAT 

31 LI partial ( 51) 

31 LI total ( 51) ...C G..C..CTCT..C..C C..C..CA C. 

31 LI wt ( 101) ATTATCACGCAGGCAGTGCTAGGCTGCTTACAGTAGGCCATCCATATTAT 

31 LI partial ( 101) 

31 LI total (101) .C..C T..TTC AT. .T.G. .C. .C. .T. X C..C 

31 LI wt ( 151) TCCATACCTAAATCTGACMTCCTAAAAAAATAGTTGTACCAAAGGTGTC 

31 LI partial ( 151) 

31 LI total ( 151) . .T. .C. .A. .G C. .A. .G. .G. X. .C. .C C. . 

31 LI wt ( 201) AGGA1TACMTATAGGGTATTTAGGGTTCGTTTACCAGATCCAAACAAAT 

31 LI partial ( 201) 

31 LI total ( 201) T. .T. .6 C. .A. X. X. .A. XA.A. .G C G. 

31 LI wt ( 251) nGGATTTCCTGATACATCTTTTTATAATCCTGAAACTCMCGCTTAGTT 

31 LI partial ( 251) 

31 LI total ( 251) X. .T. X. .A. X. X C. X. X. .A C. . .A. A. .G. X 

31 LI wt ( 301) TGGGCCTGTGTTGGTTTAGAGGTAGGTCGCGGGCAGCCATTAGGTGTAGG 

31 LI partial ( 301) 

31 LI total ( 301) T C G. .A. X. . .A. A. .T. .A G C. . 

31 LI wt ( 351) TATTAGTGGTCATCCATTATTAAATAAATTTGATGACACTGAAAACTCTA 

31 LI partial ( 351) 

31 LI total ( 351) . . XTC C G..G. X. .G. X. X C 

31 LI wt ( 401) ATAGATATGCCGGTGGTCCTGGCACTGATAATAGGGAATGTATATCAATG 

31 LI partial ( 401) 

31 LI total (401) X C..T A. .T. X. X. X. .A C..T... 

FIG.1A 
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31 LI wt ( 451) GATTATAAACAAACACAACTGTGTTTACTT6GTT6CAAACCACCTATT6G 

31 LI partial ( 451) 

31 LI total (451) ..C..C..G C...T GT.G T..G A..C. 

31 LI wt ( 501) AGAGCATTGGGGTAAAGGTAGTCCTTGTAGTAACAATGCTATTACCCCTG 

31 LI partial ( 501) 

31 LI total ( 501) T..A..C G. . .TC. . .A. . .TC C C A. 

31 LI wt ( 551) GTGATTGTCCTCCATTAGAATTAAAAAATTCAGTTATACAAGATGGGGAT 

31 LI partial ( 551) 

31 LI total ( 551) . . . .C A G G. .G. .C. .T. .C. .C C. .T. .C 

31 LI wt ( 601) ATGGTTGATACAGGCTTTGGAGCTATGGATT1TACTGCTTTACAAGACAC 

31 LI partial ( 601) 

31 LI total (601) C..C..C..T..C..T C..C..C G 

31 LI wt ( 651) TAAMGTMTGTTCCT1TGGACATTTGTMTTCTATTTGTAAATATCCAG 

31 LI partial ( 651) 

31 LI total ( 651) C. .GTC. . .C. .C. .A C C C G. .C. . . . 

31 LI wt ( 701) ATTATCTTAAAATGGTTGCTGAGCCATATGGCGATACATTA 1 1 1 1 1 1 1 AT 

31 LI partial (701) C C..C. .G. .C. .C. .C 

31 LI total (701) .C..CT.G..G C A C C..C. .G. .C. .C. .C 

31 LI wt ( 751) nACGTAGGGMCAMTGTTTGTMGGCATTTTTTTAATAGATCAGGCAC 

31 LI partial ( 751) ..G A G C C..C..C..C C 

31 LI total ( 751) . .G A G C CC..C..C C 

31 LI wt ( 801) GGTTGGTGAATCGGTCCCTACTGACTTATATATTAAAGGCTCCGGTTCAA 

31 LI partial (801) C..A T A. .C. . .C.G. .C. .C. .G C. 

31 LI total (801) C..A T A. .C. . .C.G. .C. .C. .G C. 

31 LI wt ( 851) CAGCTACTTTAGCTAACAGTACATACTTTCCTACACCTAGCGGCTCCATG 

31 LI partial (851) .C CC.G TCC..C C..A. .T..ATCT 

31 LI total ( 851) .C CC.G TCC. .C C. .A. .T..ATCT 

31 LI wt ( 901) GTTACTTCAGATGCACAMTTTTTAATAAACCATATTGGATGCAACGTGC 

31 LI partial (901) . .C..C. .C. .C..T..G. .C. .C..C. .G C G 

31 LI total (901) ..C..C..C..C..T..G..C..C..C..G C G 



FIG.1B 
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31 LI wt ( 951 

31 LI partial ( 951 

31 LI total ( 951 

31 LI wt (1001 

31 LI partial (1001 

31 LI total (1001 

31 LI wt (1051 

31 LI partial (1051 

31 LI total (1051 

31 LI wt (1101 

31 LI partial (1101 

31 LI total (1101 

31 LI wt (1151 

31 LI partial (1151 

31 LI total (1151 

31 LI wt (1201 

31 LI partial (1201 

31 LI total (1201 

31 LI wt (1251 

31 LI partial (1251 

31 LI total (1251 

31 LI wt (1301 

31 LI partial (1301 

31 LI total (1301 

31 LI wt (1351 

31 LI partial (1351 

31 LI total (1351 

31 LI wt (1401 

31 LI partial (1401 

31 LI total (1401 
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TCAGGGACACMTMTGGTATnGTTGGGGCAATCAGTTATTTGTTACTG 

A T C..C C T..C...C.G..C..G. ... 

A T C..C C T..C...C.G..C..G. ... 

TGGTAGATACCACACGTAGTACCAATATGTCTGTTTGTGCTGCAATTGCA 

... .C G...TC C C C..T 

....C G...TC C C C..T 

MCAGTGATACTACAmAAMGTAGTMTTrTAAAGAGTATTTAAGACA 

. . .TC. . .C C. .C. .GTCCTC. . .C. .C. .G CC.G 

. . .TC. . .C C. .C. .GTCCTC. . .C. .C. .G CC.G 

TGGTGAGGMTnGATnACMTTrATATTTCAGTTATGCAAAATAACAT 

C...C.G C..C..C G G. .C. .CC 

C...C.G C..C..C G G..C..CC 



TATCTGCAGACATAATGACATATATTCACAGTATGAATCCTGCTATTTTG 

.G T C C..C..C C C..CC. 

.G T C C..C..C. : C C..CC. 



GMGATTGGMTTTTGGATTGACCACACCTCCCTCAGGTTCTTTGGAGGA 

..G..C C..C..TC. T..A..T..C 

..G..C C..C..TC T..A..T..C A.. 

TACCTATAGGTTTGTAACCTCACAGGCCATTACATGTCAAAAAAGTGCCC 

C C..A..C..C T..A..T..C..C GTC...T. 

CCCAAAAGCCCMGGMGATCCATTTAMGAnATGTATTTTGGGAGGTT 



.A. 



.C..G..C..C..C..C. 



.A..C 



MTTTAAMGAAMGTTnCTGCAGATTTAGATCAGTTTCCACTGGGTCG 



.C..G..G C T..C..G..C..A..C...T. 



CAM1TTTTATTACAGGCAGGATATAGGGCACGTCCTAMTTTAAAGCAG 



A..G..C..G..G..A..T..T..C..A..TA.A..A..G..C..G..T. 



FIG.1C 
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31 LI wt (1451) GTAAACGTAGT6CACCCTCA6CATCTACCACTACACCAGCAAAACGTAAA 

31 LI partial (1451) 

31 LI total (1451) GA.ATC. . .T. .A. .T. .T C..C T..GA.A..G 

31 LI wt (1501) AAAACTAAAAAGTAA (SEQ ID N0:1) 

31 LI partial (1501) (SEQ ID NO: 2) 

31 LI total (1501) (SEQ ID N0:3) 



FIG.1D 
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HPV31 LI total rebuild nucleotide and amino acid sequences. 

MSLW RPS EAT VYLP PVP 
1 ATGTCTTTGT 6GAGACCATC T6AAGCTACC GTCTACTTGC CACCAGTCCC 

VSK V V. ST DEY VTR TNIY 
51 AGTCTCTAAG GTCGTCTCTA CCGACGAATA CGTCACCAGA ACCAACATCT 

Y H A GSA RLLT VGH PYY 
101 ACTACCACGC TGGTTCTGCT AGATTGTTGA CCGTCGGTCA CCCATACTAC 

SIPK SDN PKK IVVP KVS 
151 TCTATCCCAA AGTCTGACAA CCCAAAGAAG ATCGTCGTCC CAAAGGTCTC 

GLQ YRVF RVR LPD PNKF 
201 TGGTTTGCAA TACAGAGTCT TCAGAGTCAG ATTGCCAGAC CCAAACAAGT 

GFP DTS FYNP ETQ RLV 
251 TCGGTTTCCC AGACACCTCT TTCTACAACC CAGAAACCCA AAGATTGGTC 

WACV GLE VGR GQPL GVG 
301 TGGGCTTGTG TCGGTTTGGA AGTCGGTAGA GGTCAACCAT TGGGTGTCGG 

ISG HPLL NKF DDT ENSN 
351 TATCTCTGGT CACCCATTGT TGAACAAGTT CGACGACACC GAAAACTCTA 

RYA GGP GTDN REC ISM 
401 ACAGATACGC TGGTGGTCCA GGTACCGACA ACAGAGAATG TATCTCTATG 

DYKQ TQL CLL GCKP PIG 
451 GACTACAAGC AAACCCAATT GTGTTTGTTG GGTTGTAAGC CACCAATCGG 

EHW GKGS PCS NNA ITPG 
501 TGAACACTGG GGTAAGGGTT CTCCATGTTC TAACAACGCT ATCACCCCAG 

DCP PLELKNSVIQDGD 
551 GTGACTGTCC ACCATTGGAA TTGAAGAACT CTGTCATCCA AGACGGTGAC 

FIG. 2A 
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MVDT GFG AMD F T A- L QDT' 
601 ATGGTCGACA CCGGTTTCGG TGCTATGGAC TTCACCGCTT TGCAAGACAC 

KSN VPLD ICN SIC KYPD 
651 CAAGTCTAAC GTCCCATTGG ACATCTGTAA CTCTATCTGT AAGTACCCAG 

YLK M V A EPYG DTL FFY 
701 ACTACTTGAA GATGGTCGCT GAACCATACG GCGACACCTT GTTCTTCTAC 

LRRE QMF VRH FFNR S"T 
751 TTGCGTAGAG AACAGATGTT CGTAAGGCAC TTCTTCAACA GATCCu-XAC 

VGE SVPT DLY I K G SGST 
801 CGTAGGTGAA TCTGTCCCAA CCGACCTGTA CATCAAGGGC TCCGGTTCCA 

ATL ANS TYFP TPS GSM 
851 CCGCTACCCT GGCTAACTCC ACCTACTTCC CAACTCCATC TGGCTCCATG 

VTSD A Q I FNK PY WM QRA 
901 GTCACCTCCG ACGCTCAGAT CTTCAACAAG CCATACTGGA TGCAGCGTGC 

QGH NNGI CWG NQL FVTV 
951 ACAGGGTCAC AACAACGGTA TCTGTTGGGG TAACCAGCTG TTCGTGACTG 

VDT TRS TNMS VCA A I A 
1001 TGGTCGATAC CACGCGTTCT ACCAACATGT CTGTCTGTGC TGCAATCGCT 

NSDT TFK SSN FKEY LRH 
1051 AACTCTGACA CTACCTTCAA GTCCTCTAAC TTCAAGGAGT ACCTGAGACA 

GEE FDLQ FIF QLC KITL 
1101 TGGTGAGGAA TTCGATCTGC AATTCATCTT CCAGTTGTGC AAGATCACCC 

SAD IMT YIHS MNP AIL 
1151 TGTCTGCTGA CATCATGACC TACATCCACA GTATGAACCC TGCCATCCTG 

EDWN FGL TTP PSGS LED 
1201 GAGGACTGGA ACTTCGGTCT GACCACTCCA CCTTCCGGTT CTTTGGAAGA 

FIG.2B 
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TYR FVTS Q A I TCQ KSAP 
1251 CACCTACAGA TTCGTCACCT CTCAAGCTAT CACCTGTCAA AAGTCTGCTC 

QKP KED PFKD YVF WEV 
1301 CACAAAAGCC AAAGGAAGAC CCATTCAAGG ACTACGTCTT CTGGGAAGTC 

NLKE KFS ADL DQFP LGR 
1351 AACTTGAAGG AAAAGTTCTC TGCTGACTTG GACCMTTCC CATTGGGTAG 

KFL LQAG YRA RPK FKAG 
1401 AAAGTTCTTG TTGCAAGCTG GTTACAGAGC TAGACCAAAG TTCAAGGCTG 

KRS APS ASTT TPA KRK 
1451 GTAAGAGATC TGCTCCATCT GCTTCTACCA CCACCCCAGC TAAGAGAAAG 

K T K K * (SEQ ID N0:4) 
1501 AAGACCAAGA AGTAA (SEQ ID NO: 3) 

FIG.2C 
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Northern Blot Analysis 
31 wt 31 wt 16 Neg 31 R 31 R 



Full Length 
Truncated 




FIG.4 
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SEQUENCE LISTING 



<110> Merck & Co., Inc. 

Jansen, Kathrin U. 
Schultz, Loren D. 
Neeper, Michael P. 
Markus, Henry Z. 

<120> OPTIMIZED EXPRESSION OF HPV 31 LI IN 
YEAST 

<130> 21188-PCT 

<150> 60/457,172 
<151> 2003-03-24 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1515 
<212> DNA 

<213> HPV31 LI wild-type 



<400> 1 

atgtctctgt 

gttgtaagca 

aggctgctta 

atagttgtac 

ccaaacaaat 

tgggcctgtg 

catccattat 

ggcactgata 

ggttgcaaac 

attacccctg 

atggttgata 

gttcctttgg 

gagccatatg 

ttttttaata 

tccggttcaa 

gttacttcag 

aataatggta 

accaatatgt 

tttaaagagt 

aaaataacat 

gaagattgga 

tttgtaacct 

ccatttaaag 

gatcagtttc 

tttaaagcag 

aaaactaaaa 



ggcggcctag 
cggatgaata 
cagtaggcca 
caaaggtgtc 
ttggatttcc 
ttggtttaga 
taaataaatt 
atagggaatg 
cacctattgg 
gtgattgtcc 
caggctttgg 
acatttgtaa 
gcgatacatt 
gatcaggcac 
cagctacttt 
atgcacaaat 
tttgttgggg 
ctgtttgtgc 
atttaagaca 
tatctgcaga 
attttggatt 
cacaggccat 
attatgtatt 
cactgggtcg 
gtaaacgtag 
agtaa 



cgaggctact 
tgtaacacga 
tccatattat 
aggattacaa 
tgatacatct 
ggtaggtcgc 
tgatgacact 
tatatcaatg 
agagcattgg 
tccattagaa 
agctatggat 
ttctatttgt 
atttttttat 
ggttggtgaa 
agctaacagt 
ttttaataaa 
caatcagtta 
tgcaattgca 
tggtgaggaa 
cataatgaca 
gaccacacct 
tacatgtcaa 
ttgggaggtt 
caaattttta 
tgcaccctca 



gtctacttac 
accaacatat 
tccataccta 
tatagggtat 
ttttataatc 
gggcagccat 
gaaaactcta 
gattataaac 
ggtaaaggta 
ttaaaaaatt 
tttactgctt 
aaatatccag 
ttacgtaggg 
tcggtcccta 
acatactttc 
ccatattgga 
tttgttactg 
aacagtgata 
tttgatttac 
tatattcaca 
ccctcaggtt 
aaaagtgccc 
aatttaaaag 
ttacaggcag 
gcatctacca 



cacctgtccc 
attatcacgc 
aatctgacaa 
ttagggttcg 
ctgaaactca 
taggtgtagg 
atagatatgc 
aaacacaact 
gtccttgtag 
cagttataca 
tacaagacac 
attatcttaa 
aacaaatgtt 
ctgacttata 
ctacacctag 
tgcaacgtgc 
tggtagatac 
ctacatttaa 
aatttatatt 
gtatgaatcc 
ctttggagga 
cccaaaagcc 
aaaagttttc 
gatatagggc 
ctacaccagc 



agtgtctaaa 

aggcagtgct 

tcctaaaaaa 

tttaccagat 

acgcttagtt 

tattagtggt 

cggtggtcct 

gtgtttactt 

taacaatgct 

agatggggat 

taaaagtaat 

aatggttgct 

tgtaaggcat 

tattaaaggc 

cggctccatg 

tcagggacac' 

cacacgtagt 

aagtagtaat 

tcagttatgc 

tgctattttg 

tacctatagg 

caaggaagat 

tgcagattta 

acgtcctaaa 

aaaacgtaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1515 



<210> 2 
<211> 1515 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 31 partial rebuild 
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<400> 2 

atgtctctgt ggcggcctag 
gttgtaagca cggatgaata 
aggctgctta cagtaggcca 
atagttgtac caaaggtgtc 
ccaaacaaat ttggatttcc 
tgggcctgtg ttggtttaga 
catccattat taaataaatt 
ggcactgata atagggaatg 
ggttgcaaac cacctattgg 
attacccctg gtgattgtcc 
atggttgata caggctttgg 
gttcctttgg acatttgtaa 
gagccatacg gcgacacctt 
ttcttcaaca gatccggcac 
tccggttcca ccgctaccct 
gtcacctccg acgctcagat 
aacaacggta tctgttgggg 
accaacatgt ctgtctgtgc 
ttcaaggagt acctgagaca 
aagatcaccc tgtctgctga 
gaggactgga acttcggtct 
tttgtaacct cacaggccat 
ccatttaaag attatgtatt 
gatcagtttc cactgggtcg 
tttaaagcag gtaaacgtag 
aaaactaaaa agtaa 

<210> 3 
<211> 1515 
<212> DNA 
<213> Artificial Sequence 

<220> 

<223> 31 total rebuild 
<400> 3 

atgtctttgt ggagaccatc tgaagctacc gtctacttgc caccagtccc agtctctaag 60 
gtcgtctcta ccgacgaata cgtcaccaga accaacatct actaccacgc tggttctgct 120 
agattgttga ccgtcggtca cccatactac tctatcccaa agtctgacaa cccaaagaag 180 
atcgtcgtcc caaaggtctc tggtttgcaa tacagagtct tcagagtcag attgccagac 240 
ccaaacaagt tcggtttccc agacacctct ttctacaacc cagaaaccca aagattggtc 300 
tgggcttgtg tcggtttgga agtcggtaga ggtcaaccat tgggtgtcgg tatctctggt 360 
cacccattgt tgaacaagtt cgacgacacc gaaaactcta acagatacgc tggtggtcca 420 
ggtaccgaca acagagaatg tatctctatg gactacaagc aaacccaatt gtgtttgttg 480 
ggttgtaagc caccaatcgg tgaacactgg ggtaagggtt ctccatgttc taacaacgct 540 
atcaccccag gtgactgtcc accattggaa ttgaagaact ctgtcatcca agacggtgac 600 
atggtcgaca ccggtttcgg tgctatggac ttcaccgctt tgcaagacac caagtctaac 660 
gtcccattgg acatctgtaa ctctatctgt aagtacccag actacttgaa gatggtcgct 720 
gaaccatacg gcgacacctt gttcttctac ttgcgtagag aacagatgtt cgtaaggcac 780 
ttcttcaaca gatccggcac cgtaggtgaa tctgtcccaa ccgacctgta catcaagggc 840 
tccggttcca ccgctaccct ggctaactcc acctacttcc caactccatc tggctccatg 900 
gtcacctccg acgctcagat cttcaacaag ccatactgga tgcagcgtgc acagggtcac 960 
aacaacggta tctgttgggg taaccagctg ttcgtgactg tggtcgatac cacgcgttct 1020 
accaacatgt ctgtctgtgc tgcaatcgct aactctgaca ctaccttcaa gtcctctaac 1080 
ttcaaggagt acctgagaca tggtgaggaa ttcgatctgc aattcatctt ccagttgtgc 1140 
aagatcaccc tgtctgctga catcatgacc tacatccaca gtatgaaccc tgccatcctg 1200 
gaggactgga acttcggtct gaccactcca ccttccggtt ctttggaaga cacctacaga 1260 
ttcgtcacct ctcaagctat cacctgtcaa aagtctgctc cacaaaagcc aaaggaagac 1320 
ccattcaagg actacgtctt ctgggaagtc aacttgaagg aaaagttctc tgctgacttg 1380 
gaccaattcc cattgggtag aaagttcttg ttgcaagctg gttacagagc tagaccaaag 1440 
ttcaaggctg gtaagagatc tgctccatct gcttctacca ccaccccagc taagagaaag 1500 
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cgaggctact gtctacttac 
tgtaacacga accaacatat 
tccatattat tccataccta 
aggattacaa tatagggtat 
tgatacatct ttttataatc 
ggtaggtcgc gggcagccat 
tgatgacact gaaaactcta 
tatatcaatg gattataaac 
agagcattgg ggtaaaggta 
tccattagaa ttaaaaaatt 
agctatggat tttactgctt 
ttctatttgt aaatatccag 
gttcttctat ttgcgtagag 
cgtaggtgaa tctgtcccaa 
ggctaactcc acctacttcc 
cttcaacaag ccatactgga 
taaccagctg ttcgtgactg 
tgcaatcgct aactctgaca 
tggtgaggaa ttcgatctgc 
catcatgacc tacatccaca 
gaccactcca ccttccggtt 
tacatgtcaa aaaagtgccc 
ttgggaggtt aatttaaaag 
caaattttta ttacaggcag 
tgcaccctca gcatctacca 



cacctgtccc agtgtctaaa 60 
attatcacgc aggcagtgct 120 
aatctgacaa tcctaaaaaa 180 
ttagggttcg tttaccagat 240 
ctgaaactca acgcttagtt 300 
taggtgtagg tattagtggt 360 
atagatatgc cggtggtcct 420 
aaacacaact gtgtttactt 480 
gtccttgtag . taacaatgct 540 
cagttataca agatggggat 600 
tacaagacac taaaagtaat 660 
attatcttaa aatggttgct 720 
aacagatgtt cgtaaggcac 780 
ccgacctgta catcaagggc 840 
caactccatc tggctccatg 900 
tgcagcgtgc acagggtcac 960 
tggtcgatac cacgcgttct 1020 
ctaccttcaa gtcctctaac 1080 
aattcatctt ccagttgtgc 1140 
gtatgaaccc tgccatcctg 1200 
ctttggagga tacctatagg 1260 
cccaaaagcc caaggaagat 1320 
aaaagttttc tgcagattta 1380 
gatatagggc acgtcctaaa 144 0 
ctacaccagc aaaacgtaaa 1500 

1515 
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aagaccaaga agtaa 

<210> 4 
<211> 504 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HPV 31 LI 



<400> 4 








Met 


Ser 


Leu Trp Arg 


Pro 


Ser Glu 


1 




5 






Pro 


Val 


Ser Lys Val 


Val 


Ser Thr 






20 






He 


Tyr 


Tyr His Ala Gly 


Ser Ala 






35 




40 


Tyr 


Tyr 


Ser He Pro 


Lys 


Ser Asp 




50 






55 


Lys 


Val 


Ser Gly Leu 


Gin 


Tyr Arg 


65 






70 




Pro 


Asn 


Lys Phe Gly 

Q C 


Phe 


Pro Asp 


Gin 


Arg 


Leu Val Trp Ala 


Cys Val 






100 






Pro 


Leu 


Gly Val Gly 


He 


Ser Gly 






115 




120 


Asp 


Thr 


Glu Asn Ser 


Asn 


Arg Tyr 




130 






135 


Arg 


Glu 


Cys He Ser 


Met 


Asp Tyr 


145 






150 




Gly 


Cys 


Lys Pro Pro 


He 


Gly Glu 






165 






Ser 


Asn 


Asn Ala He 


Thr 


Pro Gly 






180 






Asn 


Ser 


Val He Gin Asp 


Gly Asp 






195 




200 


Met 


Asp 


Phe Thr Ala 


Leu 


Gin Asp 




210 






215 


He 


Cys 


Asn Ser He 


Cys 


Lys Tyr 


225 






230 




Glu 


Pro 


Tyr Gly Asp Thr 


Leu Phe 






245 






Phe 


Val 


Arg His Phe 


Phe 


Asn Arg 






260 






Pro 


Thr 


Asp Leu Tyr 


He 


Lys Gly 






275 




280 


Asn 


Ser 


Thr Tyr Phe 


Pro 


Thr Pro 




290 




295 


Ala 


Gin 


He Phe Asn 


Lys 


Pro Tyr 


305 






310 




Asn 


Asn 


Gly He Cys 


Trp 


Gly Asn 






325 






Thr 


Thr 


Arg Ser Thr Asn 


Met Ser 






340 






Asp 


Thr 


Thr Phe Lys 


Ser 


Ser Asn 






355 




360 


Glu 


Glu 


Phe Asp Leu Gin 


Phe He 




370 






375 


Ser 


Ala 


Asp He Met 


Thr 


Tyr He 


385 






390 




Glu 


Asp 


Trp Asn Phe Gly 


Leu Thr 



1515 



Ala Thr Val Tyr 


Leu 


Pro 


Pro 


Val 


10 






15 




Asp Glu Tyr Val 


Thr 


Arg 


Thr 


Asn 


25 




30 






Arg Leu Leu Thr 


Val 


Gly His 


Pro 




45 








Asn Pro Lys Lys 
60 


He 


Val 


Val 


Pro 


Val Phe Arg Val 


Arg 


Leu 


Pro 


Asp 


75 








80 


Thr Ser Phe Tyr 


Asn 


Pro 


Glu 


Thr 


90 






95 




Gly Leu Glu Val 


Gly 


Arg Gly 


Gin 


105 




110 






His Pro Leu Leu 


Asn 


Lys 


Phe 


Asp 




125 








Ala Gly Gly Pro 


Gly 


Thr Asp 


Asn 


140 










Lys Gin Thr Gin 


Leu 


Cys- 


Leu 


Leu 


155 






160 


His Trp Gly Lys 


Gly 


Ser 


Pro 


Cys 


170 






175 




Asp Cys Pro Pro 


Leu 


Glu 


Leu 


Lys 


185 




190 






Met Val Asp Thr 


Gly 


Phe Gly 


Ala 




205 








Thr Lys Ser Asn 


Val 


Pro 


Leu 


Asp 


220 










Pro Asp Tyr Leu 


Lys 


Met 


Val 


Ala 


235 








240 


Phe Tyr Leu Arg 


Arg 


Glu 


Gin 


Met 


250 






255 




Ser Gly Thr Val 


Gly 


Glu 


Ser 


Val 


265 




270 






Ser Gly Ser Thr 


Ala 


Thr 


Leu 


Ala 


285 








Ser Gly Ser Met 


Val 


Thr 


Ser 


Asp 


300 










Trp Met Gin Arg 


Ala 


Gin Gly 


His 


315 








320 


Gin Leu Phe Val 


Thr 


Val 


Val 


Asp 


330 






335 




Val Cys Ala Ala 


He 


Ala 


Asn 


Ser 


345 




350 






Phe Lys Glu Tyr 


Leu 


Arg 


His 


Gly 




365 








Phe Gin Leu Cys 


Lys 


He 


Thr 


Leu 


380 










His Ser Met Asn 


Pro 


Ala 


He 


Leu 


395 








400 


Thr Pro Pro Ser 


Gly 


Ser 


Leu 


Glu 
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405 410 415 



Asp 


Thr 


Tyr 


Arg 


Phe 


Val 


Thr 


Ser 


Gin 


Ala lie Thr Cys Gin Lys Ser 




420 










425 


430 


Ala 


Pro 


Gin 


Lys 


Pro 


Lys 


Glu 


Asp 


Pro 


Phe Lys Asp Tyr Val Phe Trp 






435 








440 




445 


Glu 


Val 


Asn 


Leu 


Lys 


Glu 


Lys 


Phe 


Ser 


Ala Asp Leu Asp Gin Phe Pro 




450 








455 






460 


Leu 


Gly 


Arg 


Lys 


Phe 


Leu 


Leu 


Gin 


Ala 


Gly Tyr Arg Ala Arg Pro Lys 


465 




470 








475 480 


Phe 


Lys 


Ala 


Gly 


Lys 


Arg 


Ser 


Ala 


Pro 


Ser Ala Ser Thr Thr Thr Pro 






485 










490 495 


Ala 


Lys 


Arg 


Lys 


Lys 


Thr 


Lys 


Lys 







500 



<210> 5 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 5 

cgtcgacgta aacgtgtatc atattttttt acag 

<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 6 

cagacacatg tattacatac acaac 

<210> 7 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 7 

ctcagatctc acaaaacaaa atgtctctgt ggcggcctag c 

<210> 8 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 8 

gacagatctt actttttagt ttttttacgt tttgctgg 
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^ (57) Abstract: Synthetic DNA molecules encoding the HPV31 LI protein are provided. Specifically, the present invention provides 
polynucleotides encoding HPV31 LI protein, wherein said polynucleotides are free from internal transcription termination signals 

O that are recognized by yeast. Also provided are synthetic polynucleotides encoding HPV31 LI wherein the polynucleotides have 
been codon -optimized for high level expression in a yeast cell. The synthetic molecules may be used to produce HPV31 virus- 

Q like particles (VLPs), and to produce vaccines and pharmaceutical compositions comprising the HPV3 1 VLPs. The vaccines of 

^ the present invention provide effective immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell- 
mediated immunity. 
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